173 results found.
Language Type:
Multilingual
Languages:
Basque Catalan English Galician Portuguese
Availability:
From Owner
License:
<Not Specified>
Size:
14 GByte Production Status:
Newly created-finished
Use:
Language Identification
-
Paper title:KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Luis Javier Rodríguez-Fuentes | University of the Basque Country | None | Euskal Herriko Unibertsitatea | None |
| Author 2 | Mikel Penagarikano | University of the Basque Country | None | Euskal Herriko Unibertsitatea | None |
| Author 3 | Amparo Varona | University of the Basque Country | None | ||
| Author 4 | Mireia Diez | University of the Basque Country | None | ||
| Author 5 | German Bordel | University of the Basque Country | None | University of the Basque Country | ES |
| Main Contact | Luis Javier Rodríguez-Fuentes | University of the Basque Country | ES | University of the Basque Country UPV/EHU | ES |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Basque Catalan Galician Portuguese Spanish
Availability:
Freely Available
License:
Creative Commons
Size:
65K sentences Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:TweetMT: A Parallel Microblog Corpus
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Iñaki San Vicente | Elhuyar Foundation / IXA - UPV-EHU | ot |
| Author 2 | Iñaki Alegria | University of the Basque Country (UPV/EHU) | ES |
| Author 3 | Cristina España-Bonet | Universitat Politècnica de Catalunya -- BarcelonaTech | ES |
| Author 4 | Pablo Gamallo | CITIUS, University of Santiago de Compostela | ES |
| Author 5 | Hugo Gonçalo Oliveira | CISUC, University of Coimbra | PT |
| Author 6 | Eva Martinez Garcia | TALP Research Center | ES |
| Author 7 | Antonio Toral | Dublin City Unversity | IE |
| Author 8 | Arkaitz Zubiaga | University of Warwick | GB |
| Author 9 | Nora Aranberri | University of the Basque Country | ES |
| Main Contact | Iñaki San Vicente | Elhuyar Foundation / IXA - UPV-EHU | None |
Documentation:
<Not Specified>
Written
<Not Specified>,
Language Type:
Multilingual
Languages:
Dutch English Portuguese Spanish
Availability:
Freely Available
License:
Open Source
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Language Identification
-
Paper title:VarClass: An Open-source Language Identification Tool for Language Varieties
-
Paper track:Written
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Marcos Zampieri | University of Cologne | GB |
| Author 2 | Binyam Gebre | MPI for Psycholinguistics | NL |
| Main Contact | Marcos Zampieri | University of Wolverhampton | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Portuguese italian
Availability:
Not Applicable
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
-
Paper title:Designing A Long Lasting Linguistic Project: The Case Study of ASIt
-
Paper track:Infrastructural Issues/Large Projects
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Maristella Agosti | <Not Specified> | None | University of Padua | None | University of Padua | IT |
| Author 2 | Emanuele Di Buccio | University of Padua | IT | ||||
| Author 3 | Giorgio Maria Di Nunzio | University of Padua | IT | ||||
| Author 4 | Cecilia Poletto | Goethe Universität Frankfurt am Main | DE | ||||
| Author 5 | Esther Rinke | Goethe Universität Frankfurt am Main | DE | ||||
| Main Contact | Giorgio Maria Di Nunzio | University of Padua | None |
Documentation:
<Not Specified>
Graphical presentation of terminological relations
Terminology,
Language Type:
Trilingual
Languages:
English Portuguese french
Availability:
Freely Available
License:
<Not Specified>
Size:
30000 OtherProduction Status:
Newly created-in progress
Use:
Knowledge Discovery/Representation
-
Paper title:Browsing the Terminological Structure of a Specialized Domain: A Method Based on Lexical Functions and their Classification
-
Paper track:Terminology
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Marie-Claude L' Homme | OLST, University of Montreal | CA | ||
| Author 2 | Benoît Robichaud | OLST, University of Montreal | CA | OLST/Université de Montréal | CA |
| Author 3 | Nathalie Prévil | OLST/Université de Montréal | CA | ||
| Main Contact | Marie-Claude L' Homme | OLST, University of Montreal | None |
Documentation:
Available soon
Written
Lexicon,
Language Type:
Multilingual
Languages:
Portuguese
Availability:
Freely Available
License:
<Not Specified>
Size:
211000 <Not Specified>Production Status:
Newly created-in progress
Use:
Language Modelling
-
Paper title:The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
-
Paper track:General issues
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | José Pedro Ferreira | ILTEC | None | ||||
| Author 2 | Maarten Janssen | Universidad Pompeu Fabra | None | Universitat Pompeu Fabra, Instituto Universitario de Lingüística Aplicada | None | IULA | None |
| Author 3 | Gladis Barcellos de Oliveira | NILC | None | ||||
| Author 4 | Margarita Correia | ILTEC | None | ||||
| Author 5 | Gilvan Müller de Oliveira | Instituto Internacional da Língua Portuguesa - IILP | None | ||||
| Main Contact | José Pedro Ferreira | ILTEC | PT |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Portuguese
Availability:
Freely Available
License:
Creative Commons Attribution 4.0 International License
Size:
768 definite descriptions OtherProduction Status:
Newly created-finished
Use:
Natural Language Generation
-
Paper title:Reference production in human-computer interaction: Issues for Corpus-based Referring Expression Generation
-
Paper track:Written
-
Paper status:Accept PosterMerged
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Danillo Rocha | University of São Paulo | BR |
| Author 2 | Ivandré Paraboni | University of São Paulo | BR |
| Main Contact | Ivandré Paraboni | University of São Paulo | None |
Documentation:
see readme.txt
Written
Corpus,
Language Type:
Multilingual
Languages:
Portuguese
Availability:
Freely Available
License:
CC BY 4.0
Size:
4888 sentences Production Status:
Newly created-finished
Use:
Sentence Readability Assessment
-
Paper title:A Nontrivial Sentence Corpus for the Task of Sentence Readability Assessment in Portuguese
-
Paper track:Resource paper
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Sidney Evaldo Leal | University of Sao Paulo, Institute of Mathematics and Computer Sciences | BR |
| Author 2 | Magali Sanches Duran | University of Sao Paulo, Institute of Mathematics and Computer Sciences | N/A |
| Author 3 | Sandra Maria Aluísio | University of Sao Paulo, Institute of Mathematics and Computer Sciences | N/A |
| Main Contact | Sidney Evaldo Leal | University of Sao Paulo, Institute of Mathematics and Computer Sciences | None |
Documentation:
https://github.com/sidleal/porsimplessent
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Portuguese
Availability:
Freely Available
License:
CC - BY - NC - SA
Size:
2.9 GByte Production Status:
Newly created-finished
Use:
Disfluencies detection
-
Paper title:HESITA(te) in Portuguese
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Sara Candeias | Instituto de Telecomunicações | PT |
| Author 2 | Dirce Celorico | Instituto de Telecomunicações | PT |
| Author 3 | Jorge Proença | Instituto de Telecomunicações | PT |
| Author 4 | Arlindo Veiga | Instituto de Telecomunicações | PT |
| Author 5 | Carla Lopes | Instituto de Telecomunicações | PT |
| Author 6 | Fernando Perdigão | University of Coimbra | PT |
| Main Contact | Jorge Proença | Instituto de Telecomunicações | None |
Documentation:
Yes (English)Language Type:
Trilingual
Languages:
English Portuguese Spanish
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:The Scielo Corpus: a Parallel Corpus of Scientific Publications for Biomedicine
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Mariana Neves | Hasso-Plattner Institut | DE | ||
| Author 2 | Antonio Jimeno Yepes | IBM Research Australia | AU | ||
| Author 3 | Aurélie Névéol | LIMSI, CNRS | FR | ||
| Main Contact | Mariana Neves | Hasso-Plattner Institut | None | German Federal Institute for Risk Assessment | None |
Documentation:
<Not Specified>




